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TRANSCRIPTIONAL ADAPTOR PROTEIN 



FIELD OF THE INVENTION 

The present invention relates generally to a 
5 transcriptional adaptor protein, the gene encoding the 
protein and uses thereof . 



BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features and advantages of this 
10 invention will be evident from the following detailed 
description of preferred embodiments when read in 
conjunction with the accompanying drawings in which: 

Figure 1A and B shows a Northern Blot Analyses of 
Total Cellular RNA obtained from uninduced, hemin, or TPA 
15 induced K562 cells. Figure 1A: Probes SA25, SA 21, SA11 
were obtained as a partial cDNA fragments from the K562 
TPA-Hemin subtracted cDNA library. SA25 hybridized to a 
novel 3.7 kb transcript TIG-1. SA21 hybridized to the 
TIMP-1 mRNA. SA11 hybridized to an EST of unknown 
2 0 function. Figure IB: Fold induction of TIG-1, TIMP-1, 
and R35565 mRNAs with mRNAs obtained from hemin induced 
K562 cells taken as 1. Amounts of mRNA expression were 
normalized to G3PDH mRNA expression under each inducing 
condition. 

25 Figure 2A and B shows a Northern Blot Analysis of 

TPA induced K562 mRNA demonstrating time course of TIG-1 
mRNA expression. Figure 2A: Representative northern 
blot using a partial cDNA fragment of TIG-1 as probe. 
Figure 2B: Bar graph of fold induction of TIG-1 mRNA at 

30 various time points after TPA induction of K562 cells. 

Fold induction was normalized to expression of G3PDH mRNA 
at each time point. 

Figure 3A and B shows a Northern Blot Analysis of 
the expression of TIG-1 mRNA in various tissues: Figure 

35 3A: Representative Northern Blots of Tissues. Each lane 



contains 2 fig of poly A + mRNA. Figure 3B : Bar graph 
showing relative expression of TIG-1 mRNA in each tissue 
with 1 taken as level of expression in the kidney. Each 
bar represents the average of two independent 
5 determinations of TIG-1 expression. TIG-1 mRNA 

expression was normalized to S-actin mRNA expression in 
each tissue. 

Figure 4 illustrates the sequence of the TIG-1 cDNA : 
The deduced open reading frame of the TIG-1 cDNA is 

10 shown. Underlined is the putative bipartite nuclear 

localization signal (RRMINKIDKNEDRKK) . The three circles 
underneath TAR represent a putative protein kinase C 
phosphorylation site. The sequence GSSQAE is a putative 
casein kinase II phosphorylation site. The filled in 

15 bars below the sequences NVSS and NFSV represent 
potential N-glycosylation sites. 

Figure 5 shows a representation of the amino acid 
sequence of TIG-1 protein: The glutamine and 
serine/proline rich domains are noted. The glutamine 

2 0 rich repeat sequences are shown. The putative nuclear 

localization signal is shown in black. 

Figure 6A shows a SDS-PAGE of in vitro translation 
products obtained from TIG-1 transcripts. Lane 4 
contains translation products of TIG-1 sense transcripts. 
25 Lane 5 contains translation products of TIG-1 antisense 
transcripts. Lanes 2 and 3 contains translation products 
of NF-E2 sense and antisense transcripts respectively. 
Figure 6B shows a Western Blot analysis of K562 
cytoplasmic and nuclear protein extracts. Each lane 

3 0 contains 50 fig of protein. Lanes 1 and 2 were blotted 

with preimmune serum which had been affinity purified 
with the carboxy terminal amino acids of TIG-1 protein. 
Lanes 3 and 4 were blotted with the supernatant fraction 
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of the affinity purified preimmune serum. Lanes 5 and 6 
were blotted with post-immune serum affinity purified 
with the carboxy terminal amino acids of TIG-1 protein. 

Figure 7A-C is a Transient Transfection analysis of 
5 effect GAL4:TIG-1 fusion protein on CAT reporter gene 
expression. Figure 7A shows a representation of the 
structure of the GAL4:TIG-1 fusion proteins and CAT gene 
reporter plasmid construct. Figure 7B shows a 
representative CAT assay following 4 8 hours of transient 

10 expression of the GAL4 .- TIG-1 fusion proteins. Figure 7C 
shows a bar graph showing mean and standard deviation of 
fold induction of CAT activity in each cell lysate. The 
fold induction is relative to the CAT activity of 
uninduced K562 cells. Differences in transfection 

15 efficiency were corrected for by cotransf ection with 
pCMVSgal expression plasmid. 

DETAILED DESCRIPTION OF THE INVENTION 

2 0 Throughout this application various 

publications are referenced, many in parenthesis. Full 
citations for each of these publications are provided at 
the end of the Detailed Description and throughout the 
Detailed Description. The disclosures of each of these 
25 publications in their entireties are hereby incorporated 
by reference in this application. 

The term "nucleic acid", as used herein, refers 
to either DNA or RNA. "Nucleic acid sequence" or 
"polynucleotide sequence" refers to a single- or 

3 0 double -stranded polymer of deoxyribonucleotide or 

ribonucleotide bases read from the 5' to the 3' end. It 
includes both self -replicating plasmids, infectious 
polymers of DNA or RNA, and nonfunctional DNA or RNA. 
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"Isolated" nucleic acid refers to nucleic acid which 
has been separated from an organism in a substantially 
purified form {i.e. substantially free of other 
substances originating from that organism) , and to 
5 synthetic nucleic acid. 

By a nucleic acid sequence "homologous to" or 
"complementary to", it is meant a nucleic acid that 
selectively hybridizes, duplexes or binds to DNA 
sequences encoding the protein or portions thereof when 

10 the DNA sequences encoding the protein are present in a 
genomic or cDNA library. A DNA sequence which is similar 
or complementary to a target sequence can include 
sequences which are shorter or longer than the target 
sequence so long as they meet the functional test set 

15 forth. 

Typically, the hybridization is done in a Southern 
blot protocol using a 0 . 2X SSC, 0.1% SDS, 65°C wash. The 
term "SSC" refers to a citrate-saline solution of 0.15M 
sodium chloride and 20 mM sodium citrate. Solutions are 

20 often expressed as multiples or fractions of this 

concentration. For example, 6X SSC refers to a solution 
having a sodium chloride and sodium citrate concentration 
of 6 times this amount or 0.9 M sodium chloride and 120 
mM sodium citrate. 0 . 2X SSC refers to a solution 0.2 

25 times the SSC concentration or 0.03M sodium chloride and 
4 mM sodium citrate. 

The phrase "nucleic acid molecule encoding" refers 
to a nucleic acid molecule which directs the expression 
of a specific protein or peptide. The nucleic acid 

3 0 sequences include both the DNA strand sequence that is 
transcribed into RNA and the RNA sequence that is 
translated into protein or peptide. The nucleic acid 
molecule includes both the full length nucleic acid 
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sequences as well as non-full length sequences derived 
from the full length protein. It being further 
understood that the sequence includes the degenerate 
codons of the native sequence or sequences which may be 
5 introduced to provide codon preference in a specific host 
cell. 

The term "located upstream" as used herein refers to 
linkage of a promoter upstream from a nucleic acid (DNA) 
sequence such that the promoter mediates transcription of 

10 the nucleic acid (DNA) sequence. 

The term "vector", refers to viral expression 
systems, autonomous self -replicating circular DNA 
(plasmids) , and includes both expression and 
nonexpression plasmids . Where a recombinant 

15 microorganism or cell is described as hosting an 

"expression vector, " this includes both extrachromosomal 
circular DNA and DNA that has been incorporated into the 
host chromosome (s) . Where a vector is being maintained 
by a host cell, the vector may either be stabley 

20 replicated by the cells during mitosis as an autonomous 
structure, or the vector may be incorporated within the 
host ' s genome . 

The term "plasmid" refers to an autonomous circular 
DNA molecule capable of replication in a cell, and 

25 includes both the expression and nonexpression types. 

Where a recombinant microorganism or cell is described as 
hosting an "expression plasmid", this includes latent 
viral DNA integrated into the host chromosome ( s ) . Where 
a plasmid is being maintained by a host cell, the plasmid 

3 0 is either being stabley replicated by the cell during 
mitosis as an autonomous structure, or the plasmid is 
incorporated within the host ' s genome . 
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The phrase "heterologous protein" or " recombinant ly 
produced heterologous protein" refers to a peptide or 
protein of interest produced using cells that do not have 
an endogenous copy of DNA able to express the peptide or 
5 protein of interest. The cells produce the peptide or 
protein because they have been genetically altered by the 
introduction of the appropriate nucleic acid sequences. 
The recombinant peptide or protein will not be found in 
association with peptides or proteins and other 

10 subcellular components normally associated with the cells 
producing the peptide or protein. 

The following terms are used to describe the 
sequence relationships between two or more nucleic acid 
molecules or polynucleotides, or between two or more 

15 amino acid sequences of peptides or proteins: "reference 
sequence", "comparison window", "sequence identity", 
"sequence homology", "percentage of sequence identity", 
"percentage of sequence homology", "substantial 
identity", and "substantial homology". A "reference 

2 0 sequence" is a defined sequence used as a basis for a 

sequence comparison; a reference sequence may be a subset 
of a larger sequence, for example, as a segment of a 
full-length cDNA or gene sequence given in a sequence 
listing or may comprise a complete cDNA or gene sequence. 

25 Optimal alignment of sequences for aligning a 

comparison window may be conducted, for example, by the 
local homology algorithm of Smith and Waterman (1981) , by 
the homology alignment algorithm of Needleman and Wunsch 
(1970) , by the search for similarity method of Pearson 

30 and Lipman (1988) , or by computerized implementations of 
these algorithms (GAP , BESTFIT, FASTA, and TFASTA in the 
Wisconsin Genetics Software Package Release 7.0, Genetics 
Computer Group, 575 Science Dr., Madison, Wis.). 



As applied to nucleic acid molecules or 
polynucleotides, the terms "substantial identity" or 
"substantial sequence identity" mean that two nucleic 
acid sequences, when optimally aligned (see above) , share 
5 at least 90 percent sequence identity, preferably at 
least 95 percent sequence identity, more preferably at 
least 96, 97, 98 or 99 percent sequence identity. 

"Percentage nucleotide (or nucleic acid) identity" 
or "percentage nucleotide (or nucleic acid) sequence 

10 identity" refers to a comparison of the nucleotides of 

two nucleic acid molecules which, when optimally aligned, 
have approximately the designated percentage of the same 
nucleotides. For example, "95% nucleotide identity" 
refers to a comparison of the nucleotides of two nucleic 

15 acid molecules which when optimally aligned have 95% 
nucleotide identity. Preferably, nucleotide positions 
which are not identical differ by redundant nucleotide 
substitutions (the nucleotide substitution does not 
change the amino acid encoded by the particular codon) . 

20 As further applied to nucleic acid molecules or 

polynucleotides, the terms "substantial homology" or 
"substantial sequence homology" mean that two nucleic 
acid sequences, when optimally aligned (see above) , share 
at least 90 percent sequence homology, preferably at 

25 least 95 percent sequence homology, more preferably at 
least 96, 97, 98 or 99 percent sequence homology. 

"Percentage nucleotide (or nucleic acid) homology" 
or "percentage nucleotide (or nucleic acid) sequence 
homology" refers to a comparison of the nucleotides of 

30 two nucleic acid molecules which, when optimally aligned, 
have approximately the designated percentage of the same 
nucleotides or nucleotides which are not identical but 
differ by redundant nucleotide substitutions (the 



- 8 - 

nucleotide substitution does not change the amino acid 
encoded by the particular codon) . For example, "95% 
nucleotide homology" refers to a comparison of the 
nucleotides of two nucleic acid molecules which when 
5 optimally aligned have 95% nucleotide homology. 

As applied to polypeptides, the terms "substantial 
identity" or "substantial sequence identity" mean that 
two peptide sequences, when optimally aligned, such as by 
the programs GAP or BESTFIT using default gap, share at 

10 least 90 percent sequence identity, preferably at least 
95 percent sequence identity, more preferably at least 
96, 97, 98 or 99 percent sequence identity. 

"Percentage amino acid identity" or "percentage 
amino acid sequence identity" refers to a comparison of 

15 the amino acids of two polypeptides which, when optimally 
aligned, have approximately the designated percentage of 
the same amino acids. For example, "95% amino acid 
identity" refers to a comparison of the amino acids of 
two polypeptides which when optimally aligned have 95% 

20 amino acid identity. Preferably, residue positions which 
are not identical differ by conservative amino acid 
substitutions. For example, the substitution of amino 
acids having similar chemical properties such as charge 
or polarity are not likely to affect the properties of a 

25 protein. Examples include glutamine for asparagine or 
glutamic acid for aspartic acid. 

As further applied to polypeptides, the terms 
"substantial homology" or "substantial sequence homology" 
mean that two peptide sequences, when optimally aligned, 

3 0 such as by the programs GAP or BESTFIT using default gap, 
share at least 9 0 percent sequence homology, preferably 
at least 95 percent sequence homology, more preferably at 
least 96, 97, 98 or 99 percent sequence homology. 
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"Percentage amino acid homology" or "percentage 
amino acid sequence homology" refers to a comparison of 
the amino acids of two polypeptides which, when optimally 
aligned, have approximately the designated percentage of 
5 the same amino acids or conservatively substituted amino 
acids. For example, "95% amino acid homology" refers to 
a comparison of the amino acids of two polypeptides which 
when optimally aligned have 95% amino acid homology. As 
used herein, homology refers to identical amino acids or 

10 residue positions which are not identical but differ only 
by conservative amino acid substitutions. For example, 
the substitution of amino acids having similar chemical 
properties such as charge or polarity are not likely to 
affect the properties of a protein. Examples include 

15 glutamine for asparagine or glutamic acid for aspartic 
acid. 

The phrase "substantially purified" or "isolated" 
when referring to a protein (or peptide) , means a 
chemical composition which is essentially free of other 

20 cellular components. It is preferably in a homogeneous 
state although it can be in either a dry or aqueous 
solution. Purity and homogeneity are typically 
determined using analytical chemistry techniques such as 
polyacrylamide gel electrophoresis or high performance 

25 liquid chromatography. A protein (or peptide) which is 
the predominant species present in a preparation is 
substantially purified. Generally, a substantially 
purified or isolated protein (or peptide) will comprise 
more than 8 0% of all macromolecular species present in 

30 the preparation. Preferably, the protein (or peptide) is 
purified to represent greater than 90% of all 
macromolecular species present . More preferably the 
protein (or peptide) is purified to greater than 95%, and 



most preferably the protein (or peptide) is purified to 
essential homogeneity, wherein other macromolecular 
species are not detected by conventional techniques. As 
used herein, a "substantially purified" or "isolated" 
5 protein (or peptide) can be synthetically or chemically 
produced, or recombinantly produced. A "substantially 
purified" or "isolated" protein or peptide as used herein 
is not intended to include a protein or peptide separated 
from an organism. 

10 "Biological sample" or "sample" as used herein 

refers to any sample obtained from a living organism or 
from an organism that has died. Examples of biological 
samples include body fluids and tissue specimens. 

High stringent hybridization conditions are selected 

15 at about 5°C lower than the thermal melting point (Tm) 

for the specific sequence at a defined ionic strength and 
pH. The Tm is the temperature (under defined ionic 
strength and pH) at which 5 0% of the target sequence 
hybridizes to a perfectly matched probe. Typically, 

20 stringent conditions will be those in which the salt 
concentration is at least about 0.02 molar at pH 7 and 
the temperature is at least about 60°C. As other factors 
may significantly affect the stringency of hybridization, 
including, among others, base composition and size of the 

25 complementary strands, the presence of organic solvents, 
i.e. salt or formamide concentration, and the extent of 
base mismatching, the combination of parameters is more 
important than the absolute measure of any one . High 
stringency may be attained, for example, by overnight 

30 hybridization at about 68°C in a 6X SSC solution, washing 
at room temperature with 6X SSC solution, followed by 
washing at about 68°C in a 6X SSC solution then in a 0 . 6X 
SSX solution. 
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Hybridization with moderate stringency may be 
attained, for example, by: 1) filter pre -hybridizing and 
hybridizing with a solution of 3X sodium chloride, sodium 
citrate (SSC) , 50% formamide, 0 . 1M This buffer at pH 7.5, 
5 5X Denhardt ' s solution; 2) pre-hybridization at 37°C for 
4 hours; 3) hybridization at 37°C with amount of labeled 
probe equal to 3,000,000 cpm total for 16 hours; 4) wash 
in 2X SSC and 0.1% SDS solution; 5) wash 4X for 1 minute 
each at room temperature and 4X at 60°C for 3 0 minutes 

10 each; and 6) dry and expose to film. 

The phrase "selectively hybridizing to" refers to a 
nucleic acid molecule that hybridizes, duplexes or binds 
only to a particular target DNA or RNA sequence when the 
target sequences are present in a preparation of total 

15 cellular DNA or RNA. By selectively hybridizing it is 
meant that a nucleic acid molecule binds to a given 
target in a manner that is detectable in a different 
manner from non- target sequence under moderate, or more 
preferably under high, stringency conditions of 

20 hybridization. "Complementary" or "target" nucleic acid 
sequences refer to those nucleic acid sequences which 
selectively hybridize to a nucleic acid molecule. Proper 
annealing conditions depend, for example, upon a nucleic 
acid molecule's length, base composition, and the number 

25 of mismatches and their position on the molecule, and 

must often be determined empirically. For discussions of 
nucleic acid molecule (probe) design and annealing 
conditions, see, for example, Sambrook et al . 1989. 

It will be readily understood by those skilled in 

3 0 the art and it is intended here, that when reference is 
made to particular sequence listings, such reference 
includes sequences which substantially correspond to its 
complementary sequence and those described including 
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allowances for minor sequencing errors, single base 
changes, deletions, substitutions and the like, such that 
any such sequence variation corresponds to the nucleic 
acid sequence of the peptide/protein to which the 
5 relevant sequence listing relates. 

The DNA molecules of the subject invention also 
include DMA molecules coding for protein analogs, 
fragments or derivatives of the protein which differ from 
naturally-occurring forms (the naturally- occurring 

10 protein) in terms of the identity or location of one or 
more amino acid residues (deletion analogs containing 
less than all of the residues specified for the protein, 
substitution analogs wherein one or more residues 
specified are replaced by other residues, and addition 

15 analogs wherein one or more amino acid residues are added 
to a terminal or medial portion of the protein) and which 
share the function of the naturally-occurring form. 
These molecules include: the incorporation of codons 
"preferred" for expression by selected non-mammalian 

20 hosts; the provision of sites for cleavage by restriction 
endonuclease enzymes; and the provision of additional 
initial, terminal or intermediate DNA sequences that 
facilitate construction of readily expressed vectors. 

As used herein, a "peptide" refers to an amino acid 

25 sequence of three to one hundred amino acids, and 

therefore an isolated peptide that comprises an amino 
acid sequence is not intended to cover amino acid 
sequences of greater than 100 amino acids. Preferably, 
the peptides that can be identified and used in 

3 0 accordance with the subject invention (whether they be 
mimotope or anti -mimotope peptides) are less than 50 
amino acids in length, and more preferably the peptides 
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are five to 2 0 amino acids in length or 20-40 amino acids 
in length. 

The peptides can contain any naturally-occurring or 
non-naturally-occurring amino acids, including the D-form 
5 of the amino acids, amino acid derivatives and amino acid 
mimics, so long as the desired function and activity of 
the peptide is maintained. The choice of including an 
(L) - or a (D) -amino acid in the peptides depends, in 
part, on the desired characteristics of the peptide. For 

10 example, the incorporation of one or more (D) -amino acids 
can confer increased stability on the peptide and can 
allow a peptide to remain active in the body for an 
extended period of time. The incorporation of one or 
more (D) -amino acids can also increase or decrease the 

15 pharmacological activity of the peptide. 

The peptides may also be cyclized, since cyclization 
may provide the peptides with superior properties over 
their linear counterparts. 

As used herein, the terms "amino acid mimic" and 

20 "mimetic" mean an amino acid analog or non-amino acid 
moiety that has the same or similar functional 
characteristic of a given amino acid. For instance, an 
amino acid mimic of a hydrophobic amino acid is one which 
is non-polar and retains hydrophobicity, generally by way 

25 of containing an aliphatic chemical group. By way of 
further example, an arginine mimic can be an analog of 
arginine which contains a side chain having a positive 
charge at physiological pH, as is characteristic of the 
guanidinium side chain reactive group of arginine. 

3 0 In addition, modifications to the peptide backbone 

and peptide bonds thereof are also encompassed within the 
scope of amino acid mimic or mimetic. Such modifications 
can be made to the amino acid, derivative thereof, 



non-amino acid moiety or the peptide either before or 
after the amino acid, derivative thereof or non-amino 
acid moiety is incorporated into the peptide. What is 
critical is that such modifications mimic the peptide 
5 backbone and bonds which make up the same and have 

substantially the same spacial arrangement and distance 
as is typical for traditional peptide bonds and 
backbones. An example of one such modification is the 
reduction of the carbonyl(s) of the amide peptide 

10 backbone to an amine. A number of reagents are available 
and well known for the reduction of amides to amines such 
as those disclosed in Wann et al . , JOC, 46:257 (1981) and 
Raucher et al., Tetrahedron. Lett., 21:14061 (1980). An 
amino acid mimic is, therefor, an organic molecule that 

15 retains the similar amino acid pharmacophore groups as is 
present in the corresponding amino acid and which 
exhibits substantially the same spatial arrangement 
between functional groups. 

The substitution of amino acids by non-naturally 

2 0 occurring amino acids and amino acid mimics as described 
above can enhance the overall activity or properties of 
an individual peptide based on the modifications to the 
backbone or side chain functionalities. For example, 
these types of alterations to the amino acid substituents 

25 and peptides can enhance the peptide's stability to 
enzymatic breakdown and increase biological activity. 
Modifications to the peptide backbone similarly can add 
stability and enhance activity. 

One skilled in the art, using the above sequences or 

30 formulae, can easily synthesize the peptides. Standard 
procedures for preparing synthetic peptides are well 
known in the art. The novel peptides can be synthesized 
using: the solid phase peptide synthesis (SPPS) method of 
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Merrifield (J. Am. Chem. Soc . , 85:2149 (1964)) or 
modifications of SPPS; or, the peptides can be 
synthesized using standard solution methods well known in 
the art (see, for example, Bodanzsky, ML, Principles of 
5 Peptide Synthesis, 2nd revised ed. , Springer-Verlag (1988 
and 1993)). Alternatively, simultaneous multiple peptide 
synthesis (SMPS) techniques well known in the art can be 
used. Peptides prepared by the method of Merrifield can 
be synthesized using an automated peptide synthesizer 

10 such as the Applied Biosystems 431A-01 Peptide 

Synthesizer (Mountain View, Calif.) or using the manual 
peptide synthesis technique described by Houghten, Proc. 
Natl. Acad. Sci., USA 82:5131 (1985). 

With these definitions in mind, the subject 

15 invention provides isolated nucleic acid molecules and 
proteins encoded by the isolated nuclei acid molecules. 
The proteins have transcriptional activation activity, 
i.e. they activate transcription. The nucleic acid 
molecules can be deoxyribonucleic acid (DNA) or 

2 0 ribonucleic acid (RNA, including messenger RNA or mRNA) , 

genomic or recombinant, biologically isolated or 
synthetic . 

The DNA molecules can be a cDNA molecule, which is a 
DNA copy of a messenger RNA (mRNA) encoding the protein. 
25 An example of the protein is the protein encoded by 

the nucleotide sequence as shown in SEQ ID NO: 1 (this is 
the open reading frame) . The amino acid sequence encoded 
by this nucleotide sequence is shown in SEQ ID NO:3. 
The full nucleotide sequence is as shown in SEQ ID NO:2. 

3 0 The amino acid sequence encoded by this nucleotide 

sequence is shown in SEQ ID NO: 4. 

The invention also provides an oligonucleotide that 
is complementary to at least a portion of the mRNA 
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encoding the protein. Oligonucleotides can be RNA or 
single-stranded DNA, and can be complementary to the 
entire mRNA molecule encoding the protein (i.e. of the 
same nucleotide length as the entire molecule) . It may 
5 be desirable, however, to work with a shorter molecule. 
In this instance, the oligonucleotide can be 
complementary to a portion of the entire mRNA molecule 
encoding the protein. These shorter oligonucleotides are 
capable of hybridizing to the mRNA encoding the entire 

10 molecule, and preferably consist of about twenty to about 
one hundred nucleotides. These oligonucleotides can be 
used to reduce levels of proteins having transcriptional 
activating activity, by introducing into cells an RNA or 
single-stranded DNA molecule that is complementary to at 

15 least a portion of the mRNA of the protein (i.e. by 

introducing the oligonucleotide) . The oligonucleotide 
can base -pair with the mRNA of the protein, preventing 
translation of the mRNA into protein. Thus, an 
oligonucleotide can prevent translation of mRNA encoding 

2 0 the protein into a functional protein. It may be 

desirable to place the oligonucleotide downstream and 
under the control of a particular promoter, so that the 
oligonucleotide will prevent translation of mRNA encoding 
the protein only in cells in which the particular 
25 promoter functions. 

More particularly, an oligonucleotide complementary 
to at least a portion of mRNA encoding a transcriptional 
activator protein can be used to decrease expression of a 
functional channel. A cell with a first level of 

3 0 expression of a functional transcriptional activator 

protein is selected, and then the oligonucleotide is 
introduced into the cell. The oligonucleotide blocks 
expression of functional transcriptional activator 
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protein, resulting in a second level of expression of a 
functional transcriptional activator protein in the cell. 
The second level is less than the initial first level. 

Oligonucleotides can be introduced into cells by any 
5 suitable means. In one embodiment, the oligonucleotide 
RNA molecule is injected directly into the cellular 
cytoplasm, where the RNA interferes with translation. A 
vector may also be used for introduction of the 
oligonucleotide into a cell. Such vectors include 
10 various plasmid and viral vectors. For a general 
discussion of oligonucleotides such as antisense 
molecules and their use, see Han et al . 1991 and Rossi 
1995 . 

The nucleic acid molecules of the subject invention 

15 can be expressed in suitable host cells using 

conventional techniques. Any suitable host and/or vector 
system can be used to express the transcriptional 
activator protein. 

Techniques for introducing the nucleic acid 

20 molecules into the host cells may involve the use of 
expression vectors which comprise the nucleic acid 
molecules. These expression vectors (such as plasmids 
and viruses; viruses including bacteriophage) can then be 
used to introduce the nucleic acid molecules into 

25 suitable host cells. For example, DNA encoding the 

transcriptional activator protein can be injected into 
the nucleus of a host cell or transformed into the host 
cell using a suitable vector, or mRNA encoding the 
transcriptional activator protein can be injected 

3 0 directly into the host cell, in order to obtain 

expression of the transcriptional activator protein in 
the host cell. 



Various methods are known in the art for introducing 
nucleic acid molecules into host cells. One method is 
microinjection, in which DNA is injected directly into 
the nucleus of cells through fine glass needles (or RNA 
5 is injected directly into the cytoplasm of cells) . 
Alternatively, DNA can be incubated with an inert 
carbohydrate polymer (dextran) to which a positively 
charged chemical group (DEAE, for diethylaminoethyl) has 
been coupled. The DNA sticks to the DEAE-dextran via its 

10 negatively charged phosphate groups. These large DNA- 
containing particles stick in turn to the surfaces of 
cells, which are thought to take them in by a process 
known as endocytosis. Some of the DNA evades destruction 
in the cytoplasm of the cell and escapes to the nucleus, 

15 where it can be transcribed into RNA like any other gene 
in the cell. In another method, cells efficiently take 
in DNA in the form of a precipitate with calcium 
phosphate. In electroporation, cells are placed in a 
solution containing DNA and subjected to a brief 

20 electrical pulse that causes holes to open transiently in 
their membranes. DNA enters through the holes directly 
into the cytoplasm, bypassing the endocytotic vesicles 
through which they pass in the DEAE-dextran and calcium 
phosphate procedures. DNA can also be incorporated into 

25 artificial lipid vesicles, liposomes, which fuse with the 
cell membrane, delivering their contents directly into 
the cytoplasm. In an even more direct approach, DNA is 
absorbed to the surface of tungsten micropro j ectiles and 
fired into cells with a device resembling a shotgun. 

3 0 Several of these methods, microinjection, 

electroporation, and liposome fusion, have been adapted 
to introduce proteins into cells. For review, see 
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Mannino and Gould-Fogerite 1988, Shigekawa and Dower 
1988, Capecchi 1980, and Klein et al . 1987. 

Further methods for introducing nucleic acid 
molecules into cells involve the use of viral vectors. 
5 One such virus widely used for protein production is an 
insect virus, baculovirus . For a review of baculovirus 
vectors, see Miller (1989) . Various viral vectors have 
also been used to transform mammalian cells, such as 
bacteriophage, vaccinia virus, adenovirus, and 

10 retrovirus. 

As indicated, some of these methods of transforming 
a cell require the use of an intermediate plasmid vector. 
U.S. Patent No. 4,237,224 to Cohen and Boyer describes 
the production of expression systems in the form of 

15 recombinant plasmids using restriction enzyme cleavage 
and ligation with DNA ligase. These recombinant plasmids 
are then introduced by means of transformation and 
replicated in unicellular cultures including procaryotic 
organisms and eucaryotic cells grown in tissue culture. 

2 0 The DNA sequences are cloned into the plasmid vector 

using standard cloning procedures known in the art, as 
described by Sambrook et al . (1989) . 

Host cells into which the nucleic acid encoding the 
transcriptional activator protein has been introduced can 
25 be used to produce the transcriptional activator protein. 

Having identified the nucleic acid molecules 
encoding transcriptional activator proteins and methods 
for expressing the transcriptional activator proteins 
encoded thereby, the invention further provides methods 

3 0 of screening a substance (for example, a compound or 

inhibitor) for the ability of the substance to modify 
transcriptional activator protein function. In one 
embodiment, the method comprises introducing a nucleic 
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acid molecule encoding the transcriptional activator 
protein into a host cell, and expressing the 
transcriptional activator protein encoded by the molecule 
in the host cell. The cell is then exposed to a 
5 substance and evaluated to determine if the substance 
modifies the function of the transcriptional activator 
protein. In another embodiment, an isolated 
transcriptional activator protein is exposed to the 
substance for evaluation of whether the substance 

10 modifies the function of the transcriptional activator 

protein. From these evaluations, substances effective in 
altering the function of the transcriptional activator 
protein can be found. Such agents may be agonists or 
antagonists, with antagonists being preferred herein. 

15 The evaluation of a cell to determine if the 

substance modifies the function of the transcriptional 
activator protein can be by any means known in the art. 
The evaluation can comprise the direct monitoring of 
expression of transcriptional activator protein in the 

2 0 host cell, or the evaluation can be indirect. 

The nucleic acid molecules of the subject invention 
can be used either as probes or for the design of primers 
to obtain DNA encoding other transcriptional activator 
proteins by either cloning and colony/plaque 
25 hybridization or amplification using the polymerase chain 
reaction (PCR) . 

Specific probes derived from SEQ ID NO:l can be 
employed to identify colonies or plaques containing 
cloned DNA encoding a member of the transcriptional 

3 0 activator protein family using known methods (see 

Sambrook et al . 1989) . One skilled in the art will 
recognize that by employing such probes under high 
stringency conditions (for example, hybridization at 42 °C 



with 5X SSPC and 50% formamide, washing at 50-65°C with 
0.5X SSPC), sequences having regions which are greater 
than 90% homologous or identical to the probe can be 
obtained. Sequences with lower percent homology or 
5 identity to the probe, which also encode transcriptional 
activator proteins, can be obtained by lowering the 
stringency of hybridization and washing (e.g., by 
reducing the hybridization and wash temperatures or 
reducing the amount of formamide employed) . 

10 More particularly, in one embodiment, the method 

comprises selection of a DNA molecule encoding a 
transcriptional activator protein, or a fragment thereof, 
the DNA molecule having a nucleotide sequence as shown in 
SEQ id NO:l, and designing an oligonucleotide probe for 

15 transcriptional activator protein based on the nucleotide 
sequence of the selected DNA molecule. A genomic or cDNA 
library of an organism is then probed with the 
oligonucleotide probe, and clones are obtained from the 
library that are recognized by the oligonucleotide probe 

2 0 so as to obtain DNA encoding another transcriptional 

activator protein. 

Specific primers derived from SEQ ID N0:1 can be 
used in PCR to amplify a DNA sequence encoding a member 
of the transcriptional activator protein family using 
25 known methods (see Innis et al . 1990) . One skilled in 
the art will recognize that by employing such primers 
under high stringency conditions (for example, annealing 
at 50-60°C, depending on the length and specific 
nucleotide content of the primers employed) , sequences 

3 0 having regions greater than 75% homologous or identical 

to the primers will be amplified. 

More particularly, in a further embodiment the 
method comprises selection of a DNA molecule encoding 
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transcriptional activator protein, or a fragment thereof, 
the DNA molecule having a nucleotide sequence as shown in 
SEQ ID N0:1, designing degenerate oligonucleotide primers 
based on the nucleotide sequence of the selected DNA 
5 molecule, and employing such primers in the polymerase 
chain reaction using as a template a DNA sample to be 
screened for the presence of transcriptional activator 
protein-encoding sequences. The resulting PCR products 
can be isolated and sequenced to identify DNA fragments 

10 that encode polypeptide sequences corresponding to the 
targeted region of transcriptional activator protein. 

Various modifications of the nucleic acid and amino 
acid sequences disclosed herein are covered by the 
subject invention. These varied sequences still encode a 

15 functional transcriptional activator protein. The 

invention thus further provides an isolated nucleic acid 
molecule encoding a transcriptional activator protein, 
the nucleic acid molecule encoding a first amino acid 
sequence having at least 90% amino acid identity to a 

2 0 second amino acid sequence, the second amino acid 

sequence as shown in SEQ ID NO: 3. In further 
embodiments, the first amino acid sequence has at least 
95%, 96%, 97%, 98%, or 99% amino acid identity to SEQ ID 
NO: 3 . 

25 The invention further provides an isolated DNA 

oligomer capable of hybridizing to the nucleic acid 
molecule encoding the transcriptional activator protein 
according to the subject invention. Such oligomers can 
be used as probes in a method of detecting the presence 

3 0 of transcriptional activator protein in a sample. More 

particularly, a sample can be contacted with the DNA 
oligomer and the DNA oligomer will hybridize to any 
transcriptional activator protein present in the sample, 
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forming a complex therewith. The complex can then be 
detected, thereby detecting presence of transcriptional 
activator protein in the sample. 

The complex can be detected using methods known in 
5 the art. Preferably, the DNA oligomer is labeled with a 
detectable marker so that detection of the marker after 
the DNA oligomer hybridizes to any transcriptional 
activator protein in the sample (wherein non- hybridized 
DMA oligomer has been washed away) is detection of the 

10 complex. Detection of the complex indicates the presence 
of transcriptional activator protein in the sample. As 
will be readily apparent to those skilled in the art, 
such a method could also be used quantitatively to assess 
the amount of transcriptional activator protein in a 

15 sample. 

For detection, the oligomers can be labeled with, 
for example, a radioactive isotope, biotin, an element 
opaque to X-rays, or a paramagnetic ion. Radioactive 
isotopes are commonly used and are well known to those 

20 skilled in the art. Representative examples include 

indium-Ill, technetium- 9 9m, and iodine-123 . Biotin is a 
standard label which would allow detection of the biotin 
labeled oligomer with avidin. Paramagnetic ions are also 
commonly used and include, for example, chelated metal 

25 ions of chromium (III) , manganese (II) , and iron (III) . 
When using such labels, the labeled DNA oligomer can be 
imaged using methods known to those skilled in the art. 
Such imaging methods include, but are not limited to, X- 
ray, CAT scan, PET scan, NMRI , and fluoroscopy. Other 

30 suitable labels include enzymatic labels (horseradish 
peroxidase, alkaline phosphatase, etc.) and fluorescent 
labels (such as FITC or rhodamine, etc.). 
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The invention further provides an isolated 
transcriptional activator protein. The protein is 
preferably encoded by a nucleotide sequence as shown in 
SEQ ID N0:1. The protein preferably has an amino acid 
5 sequence as shown in SEQ ID NO: 3. Further provided is an 
isolated transcriptional activator protein encoded by a 
first amino acid sequence having at least 9 0% amino acid 
identity to a second amino acid sequence, the second 
amino acid sequence as shown in SEQ ID NO: 3. In further 
10 embodiments, the first amino acid sequence has at least 
95%, 96%, 97%, 98%, or 99% amino acid identity to SEQ ID 
NO: 3 . 

The invention further provides an antibody or 
fragment thereof specific for the transcriptional 
15 activator protein of the subject invention. Antibodies 
of the subject invention include polyclonal antibodies 
and monoclonal antibodies capable of binding to the 
transcriptional activator protein, as well as fragments 
of these antibodies, and humanized forms. Humanized 

2 0 forms of the antibodies of the subject invention may be 

generated using one of the procedures known in the art 
such as chimerization. Fragments of the antibodies of 
the present invention include, but are not limited to, 
the Fab, the F(ab') 2 , and the Fc fragments. 
25 The invention also provides hybridomas which are 

capable of producing the above -described antibodies. A 
hybridoma is an immortalized cell line which is capable 
of secreting a specific monoclonal antibody. 

In general, techniques for preparing polyclonal and 

3 0 monoclonal antibodies as well as hybridomas capable of 

producing the desired antibody are well known in the art 
(see Campbell 1984 and St. Groth et al . 1980). Any 
animal (mouse, rabbit, etc.) which is known to produce 
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antibodies can be immunized with the antigenic 
transcriptional activator protein (or an antigenic 
fragment thereof) . Methods for immunization are well 
known in the art. Such methods include subcutaneous or 
5 intraperitoneal injection of the protein. One skilled in 
the art will recognize that the amount of the protein 
used for immunization will vary based on the animal which 
is immunized, the antigenicity of the protein, and the 
site of injection. 

10 The protein which is used as an immunogen may be 

modified or administered in an adjuvant in order to 
increase the protein's antigenicity. Methods of 
increasing the antigenicity of a protein are well known 
in the art and include, but are not limited to, coupling 

15 the antigen with a heterologous protein (such as a 

globulin or beta-galactosidase) or through the inclusion 
of an adjuvant during immunization. 

For monoclonal antibodies, spleen cells from the 
immunized animals are removed, fused with myeloma cells, 

2 0 such as SP2/0-Ag 15 myeloma cells, and allowed to become 
monoclonal antibody producing hybridoma cells. 

Any one of a number of methods well known in the art 
can be used to identify the hybridoma cell which produces 
an antibody with the desired characteristics. These 

2 5 include screening the hybridomas with an ELISA assay, 

western blot analysis, or radioimmunoassay (Lutz et al . 
1988) . 

Hybridomas secreting the desired antibodies are 
cloned and the class and subclass are determined using 

3 0 procedures known in the art (Campbell 1984) . 

For polyclonal antibodies, antibody containing 
antisera is isolated from the immunized animal and is 
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screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. 

In accordance with the above discussion, the subject 
invention further provides a method of producing an 
5 antibody specific for a transcriptional activator protein 
in a host. The method comprises selecting the isolated 
transcriptional activator protein or an antigenic portion 
thereof and introducing the selected transcriptional 
activator protein or antigenic portion thereof into a 

10 host to induce production of an antibody specific for 
transcriptional activator protein in the host. 

The present invention further provides the above- 
described antibodies in detectably labeled form. 
Antibodies can be detectably labeled through the use of 

15 radioisotopes, affinity labels (such as biotin, avidin, 
etc.), enzymatic labels (such as horseradish peroxidase, 
alkaline phosphatase, etc.), fluorescent labels (such as 
FITC or rhodamine, etc.), paramagnetic atoms, etc. 
Procedures for accomplishing such labeling are well known 

20 in the art, for example see Sternberger et al . 1970, 

Bayer et al . 1979, Engval et al . 1972, and Goding 1976. 

The labeled antibodies or fragments thereof of the 
present invention can be used for in vitro, in vivo, and 
in situ assays to identify cells or tissues which express 

25 transcriptional activator protein, to identify samples 
containing transcriptional activator protein, or to 
detect the presence of transcriptional activator protein 
in a sample. More particularly, the antibodies or 
fragments thereof can thus be used to detect the presence 

3 0 of transcriptional activator protein in a sample, by 
contacting the sample with the antibody or fragment 
thereof. The antibody or fragment thereof binds to any 
transcriptional activator protein present in the sample, 
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forming a complex therewith. The complex can then be 
detected, thereby detecting the presence of 
transcriptional activator protein in the sample. As will 
be readily apparent to those skilled in the art, such a 
5 method could also be used quantitatively to assess the 
amount of transcriptional activator protein in a sample. 
As should also be readily apparent, such an antibody may 
also be used to decrease levels of functional 
transcriptional activator protein, by blocking the 
10 protein. 

EXAMPLE 

A subtractive hybridization protocol was used to 
15 identify novel expressed sequence tags (ESTs) 

corresponding to mRNAs whose expression was induced upon 
exposure of the human leukemia cell line K562 to the 
phorbol ester 12 -O-tetradecanolyphorbol - 13 -acetate (TPA) . 

The complete open reading frame of one of the novel 

2 0 ESTs, named TIG-1, was obtained by screening K562 cell 

and placental cDNA libraries. The deduced open reading 
frame of the TIG-1 cDNA encodes for a glutamine repeat 
rich protein with a predicted molecular weight of 63 
kDa. The predicted open reading frame also contains a 
25 consensus bipartite nuclear localization signal, though 
no specific DNA binding domain was found. 

The corresponding TIG-1 mRNA is ubiquitously 
expressed. Placental tissue expresses the TIG-1 mRNA 200 
fold times more than the lowest expressing tissues such 

3 0 as kidney and lung. There is also preferential TIG-1 

mRNA expression in cells of bone marrow lineage. 

Tn vitro transcription/translation of the TIG-1 
cDNA yielded a polypeptide with an apparent molecular 



- 28 - 

weight of 97 kDa. Using polyclonal antibodies obtained 
from a rabbit immunized with the carboxy terminal portion 
of bacterially expressed TIG-1 protein, a polypeptide 
with molecular weight of 97 kDa was identified by Western 
5 blot analyses of protein lysates obtained from K562 
cells . 

Cotransf ection assays of K562 cells, using a 
GAL4-TIG-1 fusion gene and GAL 4 operator-CAT, indicate 
that the TIG-1 protein may have transcriptional 

10 regulatory activity when tethered to DNA. It was 
hypothesized that this novel glutamine rich protein 
participates in a protein complex that regulates gene 
transcription. It has been demonstrated by Naar et . al . , 
1999 that the amino acid sequences of peptide fragments 

15 obtained from a polypeptide found in a complex of 
proteins that alters chromatin structure (ARC) are 
identical to portions of the deduced open reading frame 
of TIG-1 mRNA. 

2 0 Introduction 

The K562 human leukemia cell line can be made to 
differentiate to a variety of myeloid derived lineages. 
Induction with hemin leads to expression of markers of 
erythroid differentiation, while exposure to the phorbol 
25 ester 12-0-tetradecanolyphorbol-13 -acetate (TPA) leads to 
extinction of the erythroid phenotype and induction of a 
megakaryocyte phenotype. Previous work in many 
laboratories, has identified a set of erythroid specific 
genes, such as the globins and glycophorin A, whose 

3 0 expression was increased upon hemin induction of K562 

cells (Andersson et . al . , 1979, Charnay and Maniatis, 
1983, Dean et . al . , 1983). Gene expression during TPA 
induced megakaryocytic differentiation of K562 cells has 
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been less well studied (Alitalo et . al . 1988, Alitalo et . 
al. 1990, Lin et al . 1994, Lumelsky and Forget, 1991). 

Since the K562 cell line can serve as a model 
for pluripotent bone marrow stem cell differentiation; it 
5 is of interest to identify novel genes preferentially 
expressed in myeloid lineage cells. Subtractive 
hybridization protocol, first introduced by Wang and 
Brown, was used to identify cDNAs corresponding to novel 
mRNAs whose expression was increased after K5 62 cell 

10 induction with TPA. 

The molecular cloning, expression, and functional 
characterization of TIG-1, a novel human CAG repeat rich 
cDNA, whose corresponding mRNA contains a deduced open 
reading frame encoding for a glutamine repeat rich 

15 protein with a consensus bipartite nuclear localization 
signal is reported. The 3 . 7 kb mRNA encoding for this 
protein, though constitutively expressed, is highly 
preferentially expressed in placental and bone marrow 
lineage cells. Using rabbit polyclonal antibodies raised 

2 0 to the bacterially expressed carboxy terminal portion of 

the deduced open reading frame of TIG-1, Western blot 
analyses of K562 cell protein extracts show that TIG-1 is 
found in the cytoplasm and nucleus with an apparent 
molecular weight of 97 kDa . 
25 In transient transfection assays of K562 cells 

transfected with a GAL 4 : TIG-1 fusion gene in an 
eukaryotic expression vector, evidence is presented that 
the protein encoded by this novel cDNA has 
transcriptional regulatory activity when tethered to DNA 

3 0 by the Gal 4 DNA binding domain. 

Material and Methods 

Cell Culture: 
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K562 cells were grown in RPMI 164 0 medium 
supplemented with 10% heat inactivated fetal bovine serum 
and incubated at 37° C in the presence of 5% C0 2 . Cells 
were induced with 20 fiM hemin for 4 8 hours or 34 nM TPA 
5 for 72 hours . 

Construction of expressed sequence tag library by- 
subtraction of hemin induced cDNAs from TPA induced cDNAs 
derived from mRNAs obtained from K562 cells: 

10 Poly A + mRNAs were isolated from K562 cells induced 

with either 2 0 hemin for 72 hours or 34 nM TPA for 48 
hours. The corresponding cDNAs were synthesized using a 
cDNA construction kit (Amersham, Chicago, IL) . 

The protocol elaborated by Wang and Brown 1991 with 

15 some modification. Long cDNAs were digested with the 

restriction endonuclease Rsa 1 and then ligated to linker 
DNAs (5 ' GAATTCAGATCTCCCGGGTCACCGC3 ' and 

5 ' TGACCCGGGAGATCGAATTC3 ' ) . Linkered cDNA fragments were 
amplified by PCR. PCR amplified cDNA fragments 

2 0 constructed from the TPA induced mRNAs were used as 

"tracer" cDNAs, while a five fold molar excess of 
biotinylated PCR amplified cDNA fragments constructed 
from hemin induced mRNAs were used as "driver" cDNAs to 
produce a EST library that was highly enriched in cDNA 
25 fragments generated from the TPA induced K562 cells. 

The subtractive hybridization reaction was performed 
as follows: 

10 ,ug of cDNA fragments produced from hemin induced 
K562 mRNAs were mixed with 10 fig of photoprobe biotin™ 

3 0 (Vector Laboratories, Burlingame, CA ) kept on ice and 

was exposed to a 270 Watt sunlamp kept 10 cm above the 
mixture for 15 minutes. The biotinylation procedure was 
performed a second time. The reaction was then stopped 
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by neutralization with an equal volume of 0.1 M Tris HCl 
(pH 9.0) and the final volume was adjusted to 10 0 /zl with 
distilled water. 

Unreacted biotin species were then extracted with 
5 the addition of an equal volume of 2-Butanol and the 
extraction was repeated twice. The aqueous phase 
containing the biotinylated DNA was precipitated with 
1/10 volume of 1 M NaCl and 2 Volumes of ethanol . The 
biotinylated cDNA pool was resuspended in 10 jul of 

10 Tris : EDTA (pH 7.6). 

5 /ug of biotinylated cDNA "driver" fragments 
produced from hemin induced mRNAs were mixed with 1 ,ag of 
"tracer" cDNA fragments produced from TPA induced K562 
mRNAs and precipitated with ethanol; then resuspended in 

15 20 Ml of TE pH 8.0. This mixture was denatured by 
boiling in a water bath for 3 minutes, centrifuged 
briefly and then an equal volume of 2x hybridization 
buffer (50 mM HEPES (pH 7.5), 2 mM EDTA, 1.5 M NaCl) was 
added and overlaid with mineral oil. The hybridization 

2 0 mixture was denatured again in a boiling water bath for 3 

minutes and then hybridized for 2 hours at 63° C. 

The hybridization reaction was stopped with the 
addition of 160 piL of HE buffer solution (10 mM HEPES, 
ImM EDTA pH 7.6 prewarmed to 55° C) and the aqueous phase 
25 transferred to an eppendorf tube prewarmed at 55° C for 
five minutes. The biotinylated cDNAs were complexed by 
the addition of 5//L of streptavidin solution (10 mg/ml of 
streptavidin in 0.15 M NaCl, 10 mM HEPES pH 7.6, ImM 
EDTA) and the reaction mix was incubated at room 

3 0 temperature for 2 0 minutes. The streptavidin: biotin 

complex was extracted with equal volumes of 

phenol : chloroform (1:1) prewarmed to 3 7° C. 

Phenol : chloroform extractions were repeated until no 
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visible DNA complex was noted at interphase. Then the 
aqueous phase containing the remaining subtracted cDNAs 
was extracted with chloroform, ethanol precipitated and 
resuspended in 2 0 /iL of TE (pH 8.0) . 
5 The entire procedure described above was repeated 

using the subtracted cDNAs obtained from the short 
hybridization reaction as "tracer" , with the remaining 
biotinylated driver cDNA fragments; but this time the 
hybridization mixture was hybridized for 24 hours at 63° 
10 C. 

The subtracted cDNAs enriched for cDNAs derived from 
TPA induced K562 mRNAs were then amplified by PCR for 
thirty cycles (94° C 1 minute, 50° C 1 minute, 72° C 3 
minutes) . The resultant amplified enriched cDNA pool was 

15 then digested with the restriction endonuclease EcoRl, 
ligated into the plasmid Bluescript pBS KS 11+ , and 
transformed into XL-blue 1 cells by electroporation. 
Transformed cells were plated to produce the expressed 
sequence tag library enriched for inserts derived from 

20 TPA induced K562 cell mRNAs. 

Expressed sequence tag library sequence analysis 

Expressed sequence tag inserts in the plasmid pBS 
KS+ were flanked by the T3 and T7 promoters. Inserts 
25 were directly sequenced using T3 or T7 sequencing 

primers. Sequence analysis of the inserts was performed 
on GenBank, EMBL, PIR, and Swissprot databases using 
FASTA and BLAST algorithms . 

3 0 Screening of cDNA libraries 

The 193 bp EST fragment from clone pSA25 was used as 
a probe to screen K562 or placental cell 1 GT11 cDNA 
libraries. Using standard screening techniques we 
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identified a K562 cell clone that contained a 1.5 kb 
insert which contained the 193 bp sequence of the 
screening probe. This 1.5 kb cDNA fragment was used to 
screen the placental phage clones and identified 
5 individual clones containing 2.1 and 2.5 kb of cDNA 
sequence. DNA sequences of these partial cDNAs were 
obtained by manual and automated sequencing. 

Northern Blots: 

10 Total RNA was isolated from uninduced, hemin, or TPA 

induced K562 cells with TRI reagent (Molecular Research 
Center, Inc., Cincinnati, OH) according to the 
manufacturer's protocol. Twenty peg of total RNA was size 
fractionated in a 1.5% formaldehyde agarose gel. The RNA 

15 was transferred onto a nylon membrane overnight by 

capillary action. The nylon membrane was washed briefly 
in 6X SSC and then UV cross- linked . Membranes were 
stored at 4° C. Multiple organ and immune tissue blots 
were purchased from Clontech laboratories (Palo Alto, 

2 0 CA) . 

All blots were probed with the 193 bp EST fragment 
of TIG-1. Expression of TIG-1 mRNA in K562 cell Northern 
blots was normalized to expression of G3PDH mRNA; for 
blots obtained from Clontech, expression of TIG-1 was 
25 normalized to expression of E-actin mRNA. 

In vitro transcription and translation 

A plasmid containing 3.2 kb of the TIG-1 cDNA was 
linearized by restriction enzyme digestion and then 

3 0 transcribed with either T7 or T3 RNA polymerase to 

generate sense or antisense transcripts. Transcripts 
produced in vitro were analyzed on a 1.5% formaldehyde 
agarose gel . 
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A rabbit reticulocyte lysate obtained from Promega 
(Madison, WT) was used for the in vitro translation 
reaction. In vitro translation products were analyzed 
on a 10% SDS-polyacrylamide gel that was stained with 
5 Coomassie Blue. 

Production of antibodies to the carboxy terminal portion 
of the TIG-1 protein 

PCR primers were used to amplify the coding region 

10 of TIG-1 protein from amino acids 394 to 579. This 

fragment was cloned in frame 3' to the coding sequence 
for glutathione S-transf erase (GST) . The resultant 
plasmid was transformed into E. coli which were grown and 
induced with 0.5 mM IPTG to overexpress the GST:TIG-1 

15 fusion protein. This protein was affinity purified with 
glutathione agarose beads following the manufacturers 
directions. Confirmation that a GST fusion protein was 
obtained by performing a Western Blot using a monoclonal 
antibody directed to the GST portion of the fusion 

2 0 protein. A fusion protein of the correct expected 

molecular weight was detected. 

The GST: TIG-1 fusion protein was used to immunize a 
rabbit (CoCalico Biologies, Reamstown, PA) . Antiserum 
was obtained six weeks, 12, and 18 weeks after the first 
25 injection of the protein. 

Affinity purification of TIG-1 antiserum 

One ml of post immune serum was incubated with GST 
protein at 4° C overnight . The reaction was then 

3 0 centrifuged at 3 0 00 rpm for 5 minutes and the supernatant 

was transferred to a fresh tube. This supernatant was 
incubated at 4° C for 12 hours with a nitrocellulose 
membrane which contained the 4 6 kDa GST: TIG-1 fusion 
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protein transferred to the nitrocellulose membrane 
following SDS-PAGE electrophoresis. The membrane was 
washed in PBS at room temperature for 15' repeated x 3; 
then affinity purified antibody was eluted from the 
5 membrane by addition of elution buffer (0.1 M glycine, 
0.1 M NaCl pH 2.5) and shaking at room temperature for 
20'. The eluate was transferred to a fresh tube and 
neutralized with 1 M Tris pH 8.8 to a pH of 7.5. Bovine 
serum albumin (BSA) was added to a final concentration of 
10 5 mg/ml. The affinity purified antibodies were stored at 
4° C till further use. 

Western Blot analyses of K562 cytoplasmic and nuclear 
extracts with affinity purified antibodies: 

15 Nuclear extracts were made from uninduced, hemin, 

and TPA induced K562 cells using the Dignam protocol 
(Dignam et . al. 1983). Cytoplasmic extracts were 
obtained following Dounce homogenization and 
centrifugation to produce the nuclear pellet. 

2 0 Cytoplasmic and Dignam nuclear extracts were stored at 
-70° C till further use. 

50 /ig aliquots of protein from the cytoplasmic or 
nuclear extracts were size fractionated on a 10% SDS-PAGE 
gel and transferred to a nitrocellulose membrane. 

2 5 Membranes were incubated for 1 hour at room temperature 

with either a 1:100 dilution of preimmune serum or 1:5 
dilutions of affinity purified pre- or post -immune 
serum. The membrane was washed and incubated with goat 
antirabbit antibody followed by a chemi-luminscent 

3 0 reaction used to identify protein bands that were 

specifically recognized by post -immune serum. 

Transient trans feet ion experiments 
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5 x 10 7 cells were used for each transf ection. All 
transf ections contained 5 fig of the reporter plasmid 
pGALx50P E1B CAT, 2 jug of pCMVE gal (to assay for 
transfection efficiency) , and a test plasmid in either a 
5 1:1 or 1:5 molar ratio with the reporter plasmid. 

Plasmid DNAs were transiently transfected into K562 cells 
via electroporation at 25 0V and 960 /uF with a Bio-Rad 
(Hercules, CA) Gene pulser. After resting on ice for 30 
minutes, each electroporation reaction was resuspended in 

10 10 ml of fresh RPMI 1640 medium and incubated for 12 

hours at 3 7° C. Then each dish of transfected cells was 
split into three equal volumes and replated using either 
TPA or hemin as an inducing agent . 

Forty eight hours after transient transfection whole 

15 cell freeze-thaw lysates were prepared and stored till 
further use at -2 0° C. Protein concentration of lysates 
were measured with the Bradford Reagent. Equal amounts 
of protein lysates from each dish of transfected cells 
was used to measure S-gal and CAT protein activities via 

2 0 standard techniques. 

Results and Discussion 

Identification of expressed sequence tag fragments 
25 corresponding to TPA induced mRNAs 

Following two rounds of subtractive hybridization, 
the recombinant cDNA clones were isolated and their 
sequences searched for possible homology to genes in the 
NIH database. To confirm that the subtractive 

3 0 hybridization yielded cDNA clones corresponding to mRNAs 

whose expression increased following TPA induction of 
K562 cells, the expression of three randomly picked 
clones: SA11, SA21, and SA25 was analyzed by Northern 



- 37 - 

blot. The expression of mRNAs hybridizing to each of 
these clones was found to be upregulated following 72 
hours of TPA induction of K562 cells as shown in Figure 
1A. These three independent cDNA fragments hybridized to 
5 0.8 kb, 0.9 kb, and 3.7 kb mRNA transcripts, 
respectively. 

The cDNA fragment (SA21) was found to be identical 
to a partial sequence for the tissue inhibitor of 
metalloproteinases gene-1 (Alitalo et . al . 1990) which 

10 showed (in comparison to hemin- induced mRNA levels) a 
30 -fold TPA induced mRNA increase (Figure IB: 
SA21/TIMP-1) . The sequence of the cDNA SA11 
corresponding to a 0.8 kb mRNA was found to be identical 
to an EST in the database of unknown function. This 0.8 

15 kb mRNA demonstrated (in comparison to hemin induced 
levels) a 1.5 fold TPA induced increase. 

The 193 bp cDNA fragment SA25, which hybridized to a 
3.7 kb mRNA, was not identical to any sequence in the NIH 
database. This novel cDNA was named TPA inducible gene 

20 1 (TIG-1) . The 3.7 kb TIG-1 mRNA demonstrated (in 
comparison to hemin induced levels) a 2.5 fold TPA 
induced increase (Figure IB: SA25/TIG-1) . On the basis 
of this Northern blot analysis we believe that the 
subtractive library represents genes expressed during TPA 

25 induction of K562 cells. 

3.2. Time course of TPA induction of K562 cells on 
expression of TIG-1 mRNA: 

To determine whether there was a change in the 
steady state level of TIG-1 mRNA with shorter periods of 

30 TPA induction, total cellular RNA isolated from K562 

cells was analyzed at 1 hr, 4 hrs, 12 hrs, 24 hrs, 48 hrs 
and 72 hrs after induction with TPA. Northern blots were 
probed with a TIG-1 cDNA fragment. The steady state 
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level of TIG-1 mRNA was found to vary, with a 3.5-4 fold 
induction at 12 hrs in TPA induced cells in comparison to 
uninduced cells. The expression then declined to a 
2-2.5-fold induction over 72 hrs. An increase in the 
5 steady state level of TIG-1 mRNA occurs between 4-12 hrs 
following TPA induction (Figure 2A and 2B) . Whether 
there is a cell cycle stage dependency on the level of 
TIG-1 mRNA expression remains to be determined. The K562 
cells were not synchronized prior to induction with TPA. 

10 

Expression of TIG-1 mRNA in various tissues 
Northern blot analyses was used to determine whether the 
mRNA encoding TIG-1 exhibited any tissue restricted 
pattern of expression. Screening was used to detect the 

15 presence of human TIG-1 mRNA in hematopoietic and 

non-hematopoietic tissues. Using the 193 bp TIG-1 cDNA 
fragment as a probe, a 3.7 kb transcript was detected in 
all tissues examined, as shown in Figure 3A. The wide 
range of tissues expressing TIG-1 mRNA suggests that the 

20 protein encoded by this mRNA serves a constitutive 

function in the cell. However, the level of TIG-1 mRNA 
expression varied; with the highest amounts of expression 
in the placenta and peripheral blood leukocytes, and 
lowest in the pancreas and the kidney (Figure 3B) . In 

25 comparison to the kidney, there was 250 times more 

expression of TIG-1 mRNA in the placenta and fifty times 
more expression in bone marrow derived tissues (Figure 
3B) . Whether there is the same wide variability in the 
level of TIG-1 protein expression between tissues remains 

3 0 to be determined. 



Determination of the full length cDNA sequence of TIG-1 
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Full length K562 cell and placental cDNA libraries 
were screened in order to obtain approximately 3.4 kb of 
nucleotide sequence. This contig contained 163 bp of 5' 
untranslated region, a 1,737 bp open reading frame 
5 encoding for 579 amino acids, and 1,499 bp of 3' 
untranslated region. 

The proximal ATG codon begins at 164 bp from the 
start base pair of the 3.4 kb contig, and is preceded by 
the Kozak sequence GCCATGA in an adequate context, 

10 predicting a continuous open reading frame (ORF) of 1737 
bp as shown in Figure 4. A putative polyadenylat ion 
signal AATAAA is located 16 bp upstream of the poly A 
tail and is underlined. The complete predicted ORF of 
the TIG-1 cDNA sequence encodes a protein of 57 9 amino 

15 acids with a predicted molecular weight of 63 kDa . The 
deduced protein has an isoelectric pH of 10.0 and is rich 
in glutamine (28%) , serine (10%) and proline residues 
(17%) . The deduced amino acid sequence shows seven 
glutamine rich repeat regions containing homopolymeric 

20 repeats of 6-16 glutamine residues, as shown in Figure 5. 
A putative bipartite nuclear localization signal was 
identified at amino acid residues 515-525 (Figure 5.). 

While the full length sequence of TIG-1 cDNA was 
being determined, a partial cDNA fragment CTG7a was found 

25 in the database which was identical to the TIG-1 cDNA 

corresponding to amino acids 160-525 of the deduced open 
reading frame. This partial cDNA fragment was obtained 
by Ross et. al . 1997 following a screen of a brain cDNA 
library with a CAG repeat oligonucleotide in hopes of 

3 0 identifying novel genes involved in neurological 

disorders that are subject to trinucleotide expansion 
repeat mutations. Expression of TIG-1 mRNA in brain 
tissue is demonstrated in Figure 3A. 
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In vitro transcription and translation of TIG-1 cDNA 
To determine whether the TIG-1 cDNA clone could 
produce a mRNA transcript that yielded a polypeptide of 
the predicted 63 kDa size, in vitro transcription and 
5 translation of TIG-1 cDNA was carried out. Linearized 
TIG-1 cDNA was transcribed by T3 RNA polymerase or T7 RNA 
polymerase, to generate antisense or sense transcripts 
respectively. These transcripts were then translated in 
vitro by a rabbit reticulocyte lysate system. The 35 S 

10 methionine labeled translated products were fractionated 
on a 10% SDS-PAGE gel. Translation of the sense 
transcripts produced a polypeptide of approximately 97 
kDa and smaller polypeptides as seen in Figure 6A, lane 
4. It is possible that the smaller polypeptides arise 

15 from incomplete translational products or are due to 

proteolytic degradation. Antisense transcripts failed to 
produce any polypeptide (Figure 6A, lane 5) . No 
polypeptide bands were detected following in vitro 
translation of mRNA transcripts produced from a DNA 

20 template of the Bluescript plasmid ( Figure 6A, lane 6) 
or with a TIG-1 cDNA template (Figure 6A, lane 1) . The 
control, p45J\TF-E2 sense transcripts, yielded a 45 kDa 
polypeptide (Figure 6A, lane 2) while no translation 
products were detected with p45 NF-E2 antisense 

25 transcripts (Figure 6A, lane 3) . 

Detection of TIG-1 protein in K562 cells: 

To detect TIG-1 protein in K562 cells, an 
immuno- affinity purified rabbit polyclonal antibody was 
3 0 used to the carboxy terminal portion of bacterially 
expressed TIG-1 protein in Western Blot analyses of 
cytosolic and nuclear extracts obtained from K562 cells. 
The affinity purified post-immune serum detected a 
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protein of 97 kDa in the nuclear and cytoplasmic extracts 
of uninduced K562 cells, as shown in Figure 6B, lanes 5 
and lanes 6 . No protein bands were detected with 
pre-immune serum (Figure 6B, lanes 1, 2, 3, and 4). 
5 Similarly, the 97 kDa TIG-1 protein was detected in 

TPA- induced K562 cells with affinity-purified post immune 
serum (data not shown) . 

Western blot analyses of K562 protein lysates using 
affinity-purified antiserum preincubated with either 

10 bacterially expressed TIG-1 carboxy terminal peptide or 
bovine serum albumin confirmed the specificity of the 
antiserum. The 97 kDa TIG-1 protein band could not be 
detected by the affinity purified TIG-1 antiserum when it 
was preincubated with the TIG-1 carboxy terminal peptide ; 

15 while the 97 kDa polypeptide band was observed when 

bovine serum albumin was preincubated with the affinity 
purified TIG-1 antiserum (data not shown) . 

Though the deduced open reading frame of the TIG-1 
cDNA predicts a polypeptide of MW 63 kDa both the in 

2 0 vitro transcription and translation data shown in Figure 
6A, and the Western Blot analysis shown in Figure 6B 
demonstrate a protein product of MW 97 kDa. It is 
possible that the difference between the predicted and 
the observed MW of the TIG-1 protein is due to anomalous 

25 migration of a glutamine rich protein as observed by Cox 
et . al . 1996, or due to the high positive charge of the 
TIG-1 protein with a predicted pi of 10.0. 

CAT assay for detection of possible transcription 
30 regulatory activity of GAL4-TIG-1 fusion protein 

Though the deduced ORF of TIG-1 showed a putative 
consensus nuclear localization signal, no putative DNA 
binding domain was found. To determine whether the TIG-1 



- 42 - 



protein had any possible effect on activation of 
transcription cotransf ection assays using a cat reporter 
vector and an expression vector encoding for the full 
length TIG-1 protein fused 3 minutes to DNA binding 
5 domain of GAL4 protein (GAL4 -TIG-1) . The cat reporter 
construct contained five GAL4 binding sites 5' to the 
minimal promoter for the adenovirus E1B gene, 
pGAL4 5 E IB CAT (Figure 7A) . Expression of the GAL4 -TIG-1 
fusion plasmid would result in the tethering of TIG-1 

10 protein to DNA via the DNA binding domain of GAL 4. 

The GAL4-TIG-1 expression plasmid cotransf ected in 
the molar concentration of 1:5 to the CAT reporter 
plasmid showed a 3 -fold increase in CAT expression in 
uninduced K562 cells, when compared to the CAT activity 

15 of the Gal4 DNA binding domain taken as 1 (data not 

shown) . This suggests that TIG-1 protein could behave as 
a transcriptional activator. It was asked whether the 
glutamine rich domain of TIG-1 protein was sufficient for 
transcriptional activation (Gerber et . al . 1994). The 

20 construct GAL4-GLU which encodes for the glutamine rich 
portion of TIG-1 did not show any transact ivation 
activity (Figure 7B and 7C, GAL4-Glu) . It was verified 
that this fusion construct was indeed expressed by 
performing electrophoretic mobility shift analysis of 

25 nuclear extracts obtained from K562 cells transfected 
with this construct and a radiolabelled oligonucleotide 
probe containing the GAL4 operator sequence. The GAL4 
operator probe was shifted by a GAL 4 : GLU fusion protein 
indicating that this fusion protein was made (data not 

3 0 shown) . The reporter vector pGAL4 5 E IB CAT transfected in 
the absence of GAL4-TIG-1 or with the expression vector 
for the GAL 4 DNA binding domain did not produce any cat 
gene expression (Figure 7B and 7C: neg and GAL4 ) . 
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Strikingly, TPA induced K562 cells cotransf ected 
with the CAT reporter construct and the GAL4-TIG-1 
expression vector increased cat gene expression by 11-14 
fold as compared to uninduced cells, after normalization 
5 for transfection efficiency and the non-specific effects 
of TPA, as shown in Figure 7. panel C: GAL4-TIG-1 open 
bar. No such effect of TPA induction on K562 cells was 
found with any of the other plasmids cotransf ected in 
this experiment (Figure 7C except for GAL4-TIG-1) . This 

10 result suggests that TPA induction of protein kinase C 
activity may result in regulation of the transcriptional 
activity of the TIG-1 protein. A putative protein kinase 
C phosphorylation site is found at amino acids 428-43 0 of 
the deduced open reading frame as depicted in Figure 4 . 

15 The results of the cotransf ection assays suggest 

that the TIG-1 protein could function as a 
transcriptional activating factor, when tethered to DNA. 
Since a specific DNA binding domain could not be 
deduced, it is hypothesized that the TIG-1 protein may 

20 function as a transcriptional coactivator factor 
especially following induction of protein kinase C 
activity. Glutamine rich proteins have been observed as 
coactivators of transcriptions such as TAF110 (Hoey et. 
al . ) . Indeed, there are two classes of glutamine rich 

25 transcriptional regulatory proteins; one class including 
proteins such as TAF110 and TIG-1 contain clusters of 
glutamine residues, the other class including such 
proteins as nuclear respiratory factors 1 and 2 and Spl 
have glutamine containing clusters of hydrophobic 

30 residues (Gugneja et . al . 1996, Gill et. al. 1994). 

Glutamine rich proteins can form stable oligomers with 
other glutamine rich proteins as the polyglutamine tracts 
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form S-pleated sheets that can form polar zippers (Stott 
et. al . 1995, Perutz, 1996). 

Tjian's group identified a co-activator complex (ARC) 
that mediates chromatin -directed transcriptional 
activation. Naar et . al . identified one of the members 
of the ARC complex as a polypeptide of approximate 
molecular weight 105 kDa . Microsequence analysis of this 
polypeptide yielded two peptide fragments that were 
identical to peptides corresponding to the deduced open 
reading frame of the TIG-1 mRMA. The two peptides 
fragments were identical to amino acids 13-22 and 501-512 
of the deduced open reading frame of TIG-1 mRNA (Naar et . 
al. 1999) . 

Using the affinity purified rabbit antiserum to the 
carboxy terminal 185 amino acids of TIG-1 protein 
reported in this paper, Naar et . al . was able to 
immunoprecipitate the ARC complex and demonstrate that it 
was identical to the ARC complex that was purified using 
(affinity chromatography for) the transcriptional 
activation domain of VP-16. 

Highly related, if not identical to the ARC complex, 
is the DRIP complex identified by Freedman's group 
(Rachez et . al . 1999). The DRIP complex helps to mediate 
ligand-dependent transcription by such nuclear activators 
as the vitamin D and thyroid hormone receptors. The ARC 
and DRIP complexes share many proteins in common; the 
DRIP complex prepared from HeLa cell nuclear extracts 
contains the TIG-1 protein (Naar et . al., 1999). 

These transcriptional activator complexes appear to 
be found in all cells; furthermore there appears to be 
equal stoichiometry of the proteins within each complex. 
Though TIG-1 mRNA appears to be ubiquitously expressed, 
data demonstrates that at least at the level of mRNA 
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expression there is substantial variation in the amount 
of TIG-1 mRNA with highest levels of expression in 
placenta and bone marrow derived tissues. If indeed 
there is concomitant variation in expression of TIG-1 
5 protein, it is possible that TIG-1 protein may 

participate in multiple complexes or in particular 
tissues to function in a tissue specific manner. 

Whether TIG-1 protein directly interacts with 
transcriptional activators or serves some other function 

10 within the transcriptional activator complex remains to 
be determined. Which members of ARC interact directly 
interact with TIG-1 protein may be answered by expression 
of the TIG-1 protein in a yeast two hybrid system. 

Further studies to determine whether the activity of 

15 the TIG-1 protein is regulated by phosphorylation are 
warranted based on the transfection data. Whether the 
TIG-1 protein serves a unique function in the cell, or 
can be replaced by other members of the ARC complex will 
require gene knockout experiments. 

2 0 It is of interest that the TIG-1 mRNA encoding for 

a protein found to be part of a chromatin altering 
complex was independently identified using a subtractive 
hybridization protocol aimed at identifying novel genes 
encoding for proteins involved in differentiation of the 

25 human leukemia cell line K562. Expression of TIG-1 

mRNA was noted in hemin induced K562 cells, however, its 
expression was increased 2.5 fold when the K562 cells 
were induced with TPA. Since the phenotype of the K562 
cell switches from erythroid to megakaryocyte id it is 

30 possible that the TIG-1 protein/ARC complex is required 
for remodeling of chromatin structure at megakaryocytic 
specific promoters and enhancement of gene expression by 
a tissue specific subset of transcription factors. 
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Indeed, induction of K562 cells with hemin results in a 
very specific alteration of chromatin structure 
especially within and around the K562 cell IS-globin gene 
domain- including alteration at the locus control region 
5 (LCR) leading to expression of epsilon and gamma globin 
genes (Tuan et. al . , 1985); induction of K562 cells with 
TPA does not result in alteration of the chromatin 
structure of the JS-globin LCR, extinguishes erythroid 
specific gene expression, while activating expression of 

10 other sets of genes -presumably requiring alteration of 
the chromatin structure, possibly by ARC /DRIP at the 
promoter/ enhancers of these genes. Our data suggest that 
there is variation in the level of TIG-1 mRNA expression 
in tissues with highest levels of expression in the 

15 placenta and bone marrow derived tissues; whether TIG-1 
protein participates complexes other than ARC/DRIP 
remains to be determined. 

Transfection of the expression vector encoding for 
the GAL4:TIG-1 fusion protein into K562 cells did result 

20 in some change in K562 cell morphology. Whether the TIG-1 
protein plays a role in the TPA induced megakaryocytoid 
differentiation of K562 cells remains to be determined. 

Although preferred embodiments have been depicted 
and described in detail herein, it will be apparent to 

2 5 those skilled in the relevant art that various 

modifications, additions, substitutions and the like can 
be made without departing from the spirit of the 
invention and these are therefore considered to be within 
the scope of the invention as defined in the claims which 

30 follow. 
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What Is Claimed: 



1 1 . An isolated nucleic acid molecule having a 

2 nucleotide sequence as shown in SEQ ID N0:1. 

1 2. The isolated nucleic acid molecule of claim 1 

2 wherein said nucleic acid molecule encodes an amino acid 

3 sequence as shown in SEQ ID NO: 3. 

1 3. The isolated nucleic acid molecule of claim 1 

2 wherein said nucleic acid is deoxyribonucleic acid. 

1 4. The isolated nucleic acid molecule of claim 3 

2 wherein said deoxyribonucleic acid is cDMA. 

1 5. The isolated nucleic acid molecule of claim 1 

2 wherein said nucleic acid is ribonucleic acid. 

1 6. The isolated nucleic acid molecule of claim 5 

2 wherein said ribonucleic acid is mRNA. 

3 7. The isolated nucleic acid molecule of claim 1 

4 wherein said nucleic acid encodes a transcriptional 

5 activity. 

1 8 . An oligonucleotide complementary to at least a 

2 portion of the mRNA of claim 6. 

1 9. A cell comprising the oligonucleotide of claim 

2 8 . 

1 10. An expression vector comprising the 

2 oligonucleotide of claim 8. 
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1 11. The expression vector of claim 10 wherein the 

2 expression vector is selected from the group consisting 

3 of a plasmid and a virus. 

1 12 . A cell comprising the expression vector of 

2 claim 10. 

1 13. A method of decreasing expression of a 

2 transcriptional activator protein in a host cell, said 

3 method comprising introducing the oligonucleotide of 

4 claim 8 into the cell, wherein said oligonucleotide 

5 blocks translation of said mRNA so as to decrease 

6 expression of said transcriptional activator protein in 

7 said host cell . 

1 14 . A cell comprising the nucleic acid molecule of 

2 claim 1 . 

1 15 . An expression vector comprising the nucleic 

2 acid molecule of claim 1. 

1 16. The expression vector of claim 15 wherein said 

2 expression vector is selected from the group consisting 

3 of a plasmid and a virus. 

1 17 . A cell comprising the expression vector of 

2 claim 15. 

1 18. A method of increasing expression of 

2 transcriptional activator protein in a host cell, said 

3 method comprising: 
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4 introducing the nucleic acid molecule of claim 1 

5 into the cell; and 

6 allowing said cell to express said nucleic acid 

7 molecule resulting in the production of transcriptional 

8 activator protein in said cell. 

1 19. A method of screening a substance for the 

2 ability of the substance to modify transcriptional 

3 activator protein function, said method comprising: 

4 introducing the nucleic acid molecule of claim 1 

5 into a host cell; 

6 expressing said transcriptional activator protein 

7 encoded by said nucleic acid molecule in the host cell; 

8 exposing the cell to a substance; and 

9 evaluating the exposed cell to determine if the 

10 substance modifies the function of the transcriptional 

11 activator protein. 

1 20. The method of claim 19 wherein said evaluation 

2 comprises monitoring the expression of transcriptional 

3 activator protein. 

1 21. A method of obtaining DNA encoding a 

2 transcriptional activator protein, said method 

3 comprising: 

4 selecting a DNA molecule encoding a transcriptional 

5 activator protein, said DNA molecule having a nucleotide 

6 sequence as shown in SEQ ID NO : 1 ; 

7 designing an oligonucleotide probe for a 

8 transcriptional activator protein based on the nucleotide 

9 sequence of the selected DNA molecule; 

10 probing a genomic or cDNA library of an organism 

11 with the oligonucleotide probe; and 
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12 obtaining clones from said library that are 

13 recognized by said oligonucleotide probe, so as to obtain 

14 DNA encoding a transcriptional activator protein. 

1 22 . A method of obtaining DNA encoding a 

2 transcriptional activator protein, said method 

3 comprising: 

4 selecting a DNA molecule encoding a transcriptional 

5 activator protein, said DNA molecule having a nucleotide 

6 sequence as shown in SEQ ID NO : 1 ; 

7 designing degenerate oligonucleotide primers based 

8 on the nucleotide sequence of the selected DNA molecule; 

9 and 

10 utilizing said oligonucleotide primers in a 

11 polymerase chain reaction on a DNA sample to identify 

12 homologous DNA encoding a transcriptional activator 

13 protein in said sample. 

1 23. An isolated nucleic acid molecule encoding a 

2 transcriptional activator protein, said nucleic acid 

3 molecule encoding a first amino acid sequence having at 

4 least 90% amino acid identity to a second amino acid 

5 sequence, said second amino acid sequence as shown in SEQ 

6 ID NO : 3 . 

1 24 . A DNA oligomer capable of hybridizing to the 

2 nucleic acid molecule of claim 1. 

1 25. A method of detecting presence of a 

2 transcriptional activator protein in a sample, said 

3 method comprising: 

4 contacting a sample with the DNA oligomer of claim 

5 24, wherein said DNA oligomer hybridizes to any of said 
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6 transcriptional activator protein present in said sample, 

7 forming a complex therewith; and 

8 detecting said complex, thereby detecting presence 

9 of a transcriptional activator protein in said sample. 

1 26. The method of claim 25 wherein said DNA 

2 oligomer is labeled with a detectable marker. 

1 27. An isolated protein, wherein said protein is 

2 encoded by a nucleotide sequence as shown in SEQ ID N0:1. 

1 28. The protein of claim 27 wherein said protein 

2 has transcriptional activator activity. 

1 29. The protein of claim 27 wherein said protein is 

2 encoded by an amino acid sequence as shown in SEQ ID 

3 NO : 3 . 

1 30. An isolated protein encoded by a first amino 

2 acid sequence having at least 90% amino acid identity to 

3 a second amino acid sequence, said second amino acid 

4 sequence as shown in SEQ ID NO : 3 . 

1 31. An antibody or fragment thereof specific for 

2 the protein of claim 30. 

1 32. The antibody of claim 31 wherein said antibody 

2 comprises a monoclonal antibody. 

1 33. The antibody of claim 31 wherein said antibody 

2 comprises a polyclonal antibody. 
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1 34. A method of detecting presence of a 

2 transcriptional activator protein in a sample, said 

3 method comprising: 

4 contacting a sample with the antibody or fragment 

5 thereof of claim 31, wherein said antibody or fragment 

6 thereof binds to any of said transcriptional activator 

7 protein present in said sample, forming a complex 

8 therewith; and 

9 detecting said complex, thereby detecting presence 
10 of a transcriptional activator protein in said sample. 

1 35. The method of claim 34 wherein said antibody or 

2 fragment thereof is labeled with a detectable marker. 

1 36. A method of producing an antibody specific for 

2 a transcriptional activator protein in a host, the method 

3 comprising: 

4 selecting the isolated transcriptional activator 

5 protein of claim 27 or an antigenic portion thereof; and 

6 introducing the selected transcriptional activator 

7 protein or antigenic portion thereof into a host to 

8 induce production of an antibody specific for 

9 transcriptional activator protein in the host. 
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ABSTRACT OF THE DISCLOSURE 

The present invention is directed to isolated 
5 nucleic acid molecules encoding protein, wherein the 
protein has transcriptional activator activity. 
Expression vectors and host cells comprising the nucleic 
acid molecules are also provided, as well as methods for 
increasing or decreasing the expression of the 

10 transcriptional activator protein in host cells. The 
invention further provides methods of screening a 
substance for the ability of the substance to modify 
transcriptional activator protein function, and a method 
for isolating other transcriptional activator protein 

15 molecules. DNA oligomers capable of hybridizing to the 
nucleic acid molecule encoding the transcriptional 
activator protein are provided, which can be used to 
detect transcriptional activator protein in a sample. 
Antibodies specific for the transcriptional activator 

2 0 protein, and fragments thereof, are provided. 
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SEQUENCE LISTING 



Sequence ID NO. 1 



_atgagg aaagctggtg 

tggcacacag taaatccagc aaggatatgg agagccatgt tttcctgaag gccaagaccc 
gggacgaata cctttctctc gtggccaggc tcattatcca ttttcgagac attcataaca 
agaaatctca agcttccgtc agtgatccta tgaatgcact ccagagcctg actggcggac 
ctgctgcggg agccgctgga attggcatgc ctcctcgggg cccgggacag tctctgggcg 
ggatgggtag ctttggtgcc atgggacagc caatgtctct ctcagggcag ccgcctcctg 
ggacctcggg gatggcccct cacagcatgg ctgtcgtgtc tacggcaact ccacagaccc 
agctgcagct ccagcaggtg gcgctgcagc agcagcagca acagcagcag ttccagcagc 
agcagcaggc ggcgctacag cagcagcagc agcagcagca acagcagcag ttccaggctc 
agcagagtgc catgcagcag cagttccaag cagtagtgca gcagcagcag cagctccagc 
agcagcagca gcagcagcag catctaatta aattgcatca tcaaaatcag caacagatac 
agcagcagca acagcagctg cagcgaatag cacagctgca gctccaacaa cagcaacagc 
agcagcagca gcagcagcag cagcagcagc aggctttgga ggcccagcca ccaattcagc 
agccaccgat gcagcagcca cagcctccgc cctcccaggc tctgccccag cagctgcagc 
agatgcatca cacacagcac caccagccgc caccacagcc ccagcagcct ccagttgctc 
agaaccaacc atcacaactc ccgccacagt cgcagaccca gcctttggtg tcacaggcgc 
aagctctccc tggacaaatg ttgtataccc aaccaccact gaaatttgtc cgagctccga 



tggtggtgca gcagccccca gtgcagcccc aggtgcagca gcagcagaca gcagtacaga 
cagctcaggc tgcccagatg gtggctcccg gagtccaggt cagccagagc agcctcccca 
tgctgtcctc gccgtcaccg ggccagcagg tgcagacccc gcagtcgatg ccccctcccc 

cccagccgtc cccgcagccc ggccagccca gctcacagcc caactccaac gtcagctctg 
gccctgcccc atctcccagt agcttcctgc ccagcccctc accgcagccc tcccagagcc 
cagtgacggc gcggacccca cagaacttca gtgtcccctc acctggacct ttaaacacac 
ctgtgaaccc cagctctgtc atgagcccag ctggctccag ccaggctgag gagcagcagt 
acctggacaa gctgaagcag ctgtcgaagt acatcgagcc cctgcgccgc atgatcaaca 
agatcgacaa gaacgaagac agaaaaaagg acctgagtaa gatgaagagc cttctggaca 
ttctgacaga cccctcgaag cggtgtcccc tgaagacctt gcaaaagtgt gagatcgccc 
tggagaaact caagaatgac atgcggtgcc cactccccca ccgcccccgg tgccaccgac 
caaacagcag tacctatgcc agccgctcct ggatgccgtc ctggccaaca tccgctcacc 
tgtcttcaac cattccctgt accgcacatt cgttccagcc a 



SEQUENCE ID NO. 2 



1 ccggcgcgac ttgggcctgg ctctgtgact 
61 acgggcggcg ggagctgggg aacaggcatg 
121 agaccgcctt ccggcagaag ctggtcagtc 
181 tggcacacag taaatccagc aaggatatgg 
241 gggacgaata cctttctctc gtggccaggc 
3 01 agaaatctca agcttccgtc agtgatccta 
361 ctgctgcggg agccgctgga attggcatgc 
421 ggatgggtag ctttggtgcc atgggacagc 
481 ggacctcggg gatggcccct cacagcatgg 
541 agctgcagct ccagcaggtg gcgctgcagc 
601 agcagcaggc ggcgctacag cagcagcagc 
661 agcagagtgc catgcagcag cagttccaag 
721 agcagcagca gcagcagcag catctaatta 
7 81 agcagcagca acagcagctg cagcgaatag 
841 agcagcagca gcagcagcag cagcagcagc 
901 agccaccgat gcagcagcca cagcctccgc 
961 agatgcatca cacacagcac caccagccgc 
1021 agaaccaacc atcacaactc ccgccacagt 
1081 aagctctccc tggacaaatg ttgtataccc 



gaggcggcgg cggtggcggc caagcgggat 
gacgtttccg ggcaagagac cgactggcgg 
aaatcgagga tgccatgagg aaagctggtg 
agagccatgt tttcctgaag gccaagaccc 
tcattatcca ttttcgagac attcataaca 
tgaatgcact ccagagcctg actggcggac 
ctcctcgggg cccgggacag tctctgggcg 
caatgtctct ctcagggcag ccgcctcctg 
ctgtcgtgtc tacggcaact ccacagaccc 
agcagcagca acagcagcag ttccagcagc 
agcagcagca acagcagcag ttccaggctc 
cagtagtgca gcagcagcag cagctccagc 
aattgcatca tcaaaatcag caacagatac 
cacagctgca gctccaacaa cagcaacagc 
aggctttgga ggcccagcca ccaattcagc 
cctcccaggc tctgccccag cagctgcagc 
caccacagcc ccagcagcct ccagttgctc 
cgcagaccca gcctttggtg tcacaggcgc 
aaccaccact gaaatttgtc cgagctccga 



1141 tggtggtgca gcagccccca gtgcagcccc 
1201 cagctcaggc tgcccagatg gtggctcccg 
1261 tgctgtcctc gccgtcaccg ggccagcagg 

1321 cccagccgtc cccgcagccc ggccagccca 
13 81 gccctgcccc atctcccagt agcttcctgc 
1441 cagtgacggc gcggacccca cagaacttca 
1501 ctgtgaaccc cagctctgtc atgagcccag 
15 61 acctggacaa gctgaagcag ctgtcgaagt 
1621 agatcgacaa gaacgaagac agaaaaaagg 
1681 ttctgacaga cccctcgaag cggtgtcccc 
1741 tggagaaact caagaatgac atgcggtgcc 
18 01 caaacagcag tacctatgcc agccgctcct 
1861 tgtcttcaac cattccctgt accgcacatt 
1921 acccatcacg gccccagtgg tgtgcacccg 
1981 gagcatcccc agtgtgctcc agggtgaggt 
2041 cctggaccct tctcactgca gcaacaatgg 
2101 caaggacctc ccaagtgtgc caccactgga 
2161 aagcccgctg tggatagacc ggcagtggca 
2221 tgcaccgctg catgacctcc aggctgctgc 
2281 tgctcaacac ctgggcccag agcgtccacc 
2 341 tgcagggatg gcccgcagcc tcatcggggc 
2401 taggtgttgg cttccttaga gagcctgggg 
2461 ccttggggac ctgccaaacg aaatcccaca 

2 521 gcgggttgct tggggggcgt tggccgactt 
2581 ccaggagcca gatgcgatcc tcaggctgct 
2641 atctcagcag cgtgagggtg cactcagggt 
2701 cacccctact cgttcctata gaacacagag 
2761 attctctggt cacagttttg ggttcaggct 
2821 gaggaagcct gcaagtccag ggcacaggct 
2881 gctgctggct ccccgccacc agctgggcct 
2941 tggggcaccc agggagcagg ggcgtcaggg 
3001 catgagggcc gtgtccccac tgtgaaggat 

3 0 61 cagagcacca cacactgagc acccagagac 
3121 gggagcgcct ccctatgttg cctgccactc 
3181 gaggtccccc ctttttatgt gcactacccc 
3 241 cctgtggaaa aaaaaaaaaa aaaaaaaaaa 
3301 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



aggtgcagca gcagcagaca gcagtacaga 
gagtccaggt cagccagagc agcctcccca 
tgcagacccc gcagtcgatg ccccctcccc 

gctcacagcc caactccaac gtcagctctg 
ccagcccctc accgcagccc tcccagagcc 
gtgtcccctc acctggacct ttaaacacac 
ctggctccag ccaggctgag gagcagcagt 
acatcgagcc cctgcgccgc atgatcaaca 
acctgagtaa gatgaagagc cttctggaca 
tgaagacctt gcaaaagtgt gagatcgccc 
cactccccca ccgcccccgg tgccaccgac 
ggatgccgtc ctggccaaca tccgctcacc 
cgttccagcc atgaccgcca ttcacggccc 
gaagcgcagg cttgaggatg atgagcggca 
ggccaggctg gaccccaagt tcctggtaaa 
cactgtccac ctgatctgca agctggatga 
gctcagtgtg cccgctgact atcctgccca 
gtacgacgcc aacccctttc ctccagtcgg 
agctcccgga caagcactcg gtcaccgcct 
aggcctgcct ctcagccgcc tagccaagac 
caaggacaca cgcctcctgt cagacacttc 
ttaggttagc tttcctgctt ttatcttctg 
cctgtacaga actgggatag gcgcagtgga 
cttagagaag gccctccatg tgacttcctc 
ctcaccgtgg cctgtccacg gtccaggtcc 
gttgttagag cgtctcgtgt gtgctagacg 
gacataggaa acccttaaaa cacacatggg 
atgctgcttt gggcaggtgg agcacccccc 
gccttttgga gggagggctg gcccataggt 
cagccctcac ggcattcctg ctgagcaccg 
atcctgctgc cggcacccct gtgccgctgg 
gaagagcaag gccctcagga cccgtgtcct 
agcgggcttg gcagcgggcc gggccatgca 
tgggcaccgg ccagcaccct ctggtgagaa 
accatctgtg attataataa atttattatt 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaa 



SEQUENCE ID NO 3 



"MRKAGVAHSKSSKDMESHVFLKAKTRDEYLSLVARLIIHFRDIH 
NIOCSQASVSDPMNALQSLTGGPAAGAAGIGMPPRGPGQSLGGMGSFGAMGQPMSLSGQ 
PPPGTSGMAPHSMAWSTATPQTQLQLQQVALQQQQQQQQFQQQQQAALQQQQQQQQQ 
QQFQAQQSAMQQQFQAWQQQQQLQQQQQQQQHLIKLHHQNQQQIQQQQQQLQRIAQL 
QLQQQQQQQQQQQQQQQQALEAQPP I QQPPMQQPQPP P S QALPQQLQQMHHTQHHQPP 
PQPQQPPVAQNQPSQLPPQSQTQPLVSQAQALPGQMLYTQPPLKFVRAPMWQQPPVQ 
PQVQQQQTAVQTAQAAQMVAPGVQVSQSSLPMLSSPSPGQQVQTPQSMPPPPQPSPQP 
GQPSSQPNSNVSSGPAPSPSSFLPSPSPQPSQSPVTARTPQNFSVPSPGPLNTPVNPS 
SVMSPAGSSQAEEQQYLDKLKQLSKYIEPLRRMINKIDKNEDRKKDLSKMKSLLDILT 
DPS KRCPLKTLQKCE I ALEKLKNDMRCPLPHRPRCHRPNS STYASRS WMP S WPTS AHL 
SSTIPCTAHSFQP" 



